Finding Cross-Object Relationships from Large Databases
نویسندگان
چکیده
While traditional association rules demonstrate strong potential values such as to improve market strategies for retail industry, they are limited to finding associations among items within the same transaction. Consider a database of supermarket transactions, the traditional association rules can represent such knowledge as “80% of customers who buy Chinese tea also buy teapot at the same time.” However, they fail to represent some more interesting rules like “If a customer buys Chinese tea, s/he may most likely buy teapot within 3 days”, where the association may span across different transactions. To capture this contextual semantics which are also vital to the validation of associations, in this study, we introduce the notion of cross-object relationships. Two algorithms for mining cross-object association rules from large databases are developed by extension of Apriori algorithm. We show that traditional associations can be treated as a special case of cross-object relationships from both conceptual and algorithmic point of view.
منابع مشابه
Extending the Qualitative Trajectory Calculus Based on the Concept of Accessibility of Moving Objects in the Paths
Qualitative spatial representation and reasoning are among the important capabilities in intelligent geospatial information system development. Although a large contribution to the study of moving objects has been attributed to the quantitative use and analysis of data, such calculations are ineffective when there is little inaccurate data on position and geometry or when explicitly explaining ...
متن کاملEngineering truly automated data integration and translation systems
This thesis presents an automated, data-driven integration process for relational databases. Whereas previous integration methods assumed a large amount of user involvement as well as the availability of database meta-data, we make no use of meta-data and little end user input. This is done using a novel join and translation finding algorithm that searches for the proper key / foreign key relat...
متن کاملiProClass: an integrated, comprehensive and annotated protein classification database
The iProClass database is an integrated resource that provides comprehensive family relationships and structural and functional features of proteins, with rich links to various databases. It is extended from ProClass, a protein family database that integrates PIR superfamilies and PROSITE motifs. The iProClass currently consists of more than 200,000 non-redundant PIR and SWISS-PROT proteins org...
متن کاملEfficient Mining of Cross-Transaction Web Usage Patterns in Large Database
Web Usage Mining is the application of data mining techniques to large Web log databases in order to extract usage patterns. A cross-transaction association rule describes the association relationships among different user transactions in Web logs. In this paper, a Linear time intra-transaction frequent itemsets mining algorithm and the closure property of frequent itemsets are used to mining c...
متن کاملLarge-scale interoperability of legacy object bases
In this paper, we present a truly object-oriented approach to large-scale interoperabil-ity of heterogeneous and autonomous databases. The problem with such an environment is that users typically have limited insight into the semantics of data deened in foreign databases. We show that existing schema integration techniques are not quite suited to such environments, due to diiculties in determin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004